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Abstract 

Motivated by tiie work of Uehara et al. pLj, an im- 
proved method to recover DC coefficients from AC coeffi- 
cients of DCT-transformed images is investigated in this 
work, which finds apphcations in cryptanalysis of selec- 
tive multimedia encryption. The proposed under/over- 
flow rate minimization (FRM) method employs an op- 
timization process to get a statistically more accurate 
estimation of unknown DC coefficients, thus achieving 
a better recovery performance. It was shown by exper- 
imental results based on 200 test images that the pro- 
posed DC recovery method significantly improves the 
quality of most recovered images in terms of the PSNR 
values and several state-of-the-art objective image qual- 
ity assessment (IQA) metrics such as SSIM and MS- 
SSIM. 

1 Introduction 

The discrete cosine transform (DCT) is an orthogonal 
transform with sub-optimal performance in terms of de- 
correlation efficiency [2^. Since DCT is easier to im- 
plement than the optimal Karhunen-Loeve transform 
(KLT), it has been widely used in signal and image pro- 
cessing applications, especially for lossy image and video 
compression. Many well known image and video coding 
standards, including JPEG and MPEG-l/2/4/H.26x, 
are based on 2D block DCT [3 . 

Among all DCT coefficients, the first one, called the 
DC (direct current) coefficient, plays the most impor- 
tant role. It represents the average intensity of a block 
and carries most of the energy and the perceptual infor- 
mation. Actually, DC coefficients of all blocks form a 
thumbnail version of the original image at a lower spa- 
tial resolution. When the block-DCT is applied to im- 
age/video compression, DC coefficients often consume 
more bits than other DCT coefficients called AC (al- 
ternative current) coefficients. To give an example, to 
encode the 8-bit gray-scale Lenna image of size 512x512 
using the 8x8 block-DCT and the default quantization 

* Companion Web page: h ttp : //www . hooklee . com/default . | 
|asp?t=AC2DC 



table of JPEG [4, Table K.l], we estimate that about 
16% of all coding bits will be used to encode quantized 
DC coefficients, where the estimation is obtained by cal- 
culating entropies of quantized DCT coefficients at dif- 
ferent locations. If we apply differential coding to DC 
coefficients, the ratio will decrease to 14%, which is still 
significant. 

Due to the importance of DC coefficients, it was ar- 
gued that encrypting DC coefficients only may effec- 
tively conceal significant visual information to achieve 
the goal of light-weight selective encryption [5 . For ex- 
ample, Li et al. [6 proposed to encrypt DC coefficients 
to conceal a rough view of MPEG-encoded video se- 
quences to achieve perceptual encryption. Other hybrid 
image/ video encryption schemes using the combination 
of DC encryption and secret AC permutations were pro- 
posed in [7 . However, since secret permutations are not 
secure against plaintext attacks [8 , the hybrid encryp- 
tion scheme can be downgraded to the DC encryption 
only 

Since DCT is an orthogonal transform, DC encryption 
is believed to be secure because DC coefficients are in- 
dependent of AC coefficients. It was however shown by 
Uehara, Safavi-Naini and Ogunbona [1 that most DC 
coefficients of a DCT-transformed natural image can be 
approximately recovered from AC coefficients. Their DC 
recovery method, called the USO method, exploits two 
properties of most digital images. First, there is strong 
correlation between neighboring pixels. Second, the DC 
coefficient of each block is constrained to an interval 
defined by the DC-free edition of this block, i.e., pixel 
values calculated only from AC coefficients. 

In this work, we propose an improved DC recovery 
method, which outperforms the USO method in recov- 
ery quality. Our idea is to minimize the under /over- flow 
rate of pixel values during the recovery process. We pro- 
vide experimental results to demonstrate the superior 
performance of the improved DC recovery method in 
terms of PSNR and nine objective image quality assess- 
ment (IQA) metrics in the MeTriX MuX Visual Quality 
Assessment Package [9], which includes SSIM [10], MS- 
SSIM IIII, IFC [la, VIF [13], etc. 

The rest of the paper is organized as follows. The USO 
method is reviewed in Sec. |2j The proposed DC recovery 



1 



method is described in detail in Sec. |3] Finally, the last 
section gives concluding remarks and future work. 

2 The USO Method 

In this section, we present the implementation detail of 
the USO method by following [T, where some missing 
details in the original paper are filled in with our best 
efforts. Since in [T] it was shown that the USO method 
is robust against quantization, in this paper we do not 
consider quantization. To facilitate our discussion, we 
denote the DC coefficient and the average intensity of an 
image block B by DC{B) and 5, respectively. We use 
the orthogonal form of 2D DCT so that BC{B) = N B, 
where N is the block size. Given a block 5, define B^^"^ 
as the block derived from B by setting DC{B) to i.e.. 
Bid) ^ (B-B)^d/N = B^{d-DC{B))/N. The valid 
range of pixel values is denoted by [tmin, ^max]- 

The USO method is based on the following two prop- 
erties of most digital images. 

Property 1 The difference between two neighboring 
pixels is a Laplacian variate with zero mean and a small 
variance. 

Property 2 The range of pixel values calculated from 
B^^^ (i.e., only from AC coefficients) constrains the 
value of DC{B). 

By exploiting Property 1, the unknown DC coefficient 
of a block can be estimated from its neighboring blocks 
with known DC coefficients by minimizing the differ- 
ences of adjacent pixels along the block boundary. Three 
patterns of adjacent pixels were considered in the USO 
method: one horizontal or vertical pattern, and two di- 
agonal patterns as shown in [l] Fig. 2]. The smoothest 
pattern (i.e., the one with the smallest gradient) is cho- 
sen for DC estimation. 

Property 2 can be used to determine a valid range of 
DC(5). Since t^in <B = 5^°) + J}C{B)/N < Wx, we 
obtain 

N(t^^ - min(5(0))) < DC(5) < 7V(Wx - max(5(o))), 

(1) 

which gives an estimation of DC{B) with an accuracy 
defined by iV(tmax - ^min + min(5^°)) - max(5(°))). 

Uehara et al. took the above two properties into ac- 
count and proposed the following method to recover DC 
coefficients of a DCT-transformed image with known AC 
coefficients. Without loss of generality, we assume the 
input image is a DC-free image, i.e., an image composed 
of blocks whose DC coefficients are all zerosH] 

Step 1: Choose a corner block of the DC-free image 
as the initial reference block Bq and estimate DC coeffi- 
cients of other blocks (relative to DC(5o) = 0) by scan- 
ning the whole image from Bq to the diagonally opposite 
corner block. Property 1 is used to estimate the DC of 
each block from its neighboring block. When there are 
two neighboring blocks, the two estimates are averaged. 

-"^In principle, the DC coefficients can be any fixed value. In 1 , 
the midpoint of the valid range (1023 for 8-bit gray-scale images) 
is used. Here, we use zero to simplify our description. 



Ideally, the output of this step is an image identical with 
the original image except that the average intensity is 
darker by 'DC{Bo)/N. 

Step 2: The goal of this step is to adjust the whole 
image so that the recovered image has the same av- 
erage intensity as the original one. First, calculate 
the valid DC range of each block Bi from Eq. Q. 
Then, for each DC range [(ii,min, <^i,max], get a range of 
the intensity adjustment as follows: [<^*min' max] — 

[(^iz,min - DC*(5,))/^,(^^,max - DC* (^^j/TV] ,' whcrC 

DC*{Bi) denotes the relative DC coefficient of Bi es- 
timated in Step 1. Next, calculate (i^-^ = max((i*^-^) 

i ' 

and (iJnax = max)- Finally, the whole image is 

brightened by {d^^-^ \ <ax)/2. 

Step 3: Repeat Steps 1 and 2 by choosing the four 
corner blocks as initial reference blocks, and average the 
images obtained from four different scans to get the final 
result. 

Step 4: Since the estimated DC values in previous 
steps may not be accurate, the output pixel values in 
Step 3 may not be in the valid range. Then, a post- 
processing operation, which includes the re-scaling of 
the whole image or the adjustment of under/over-fiow 
pixel values only, can be performed. 

No specific postprocessing operation in Step 4 was 
mentioned in [1 . We found from our experiments that 
the following post-processing scheme works well. 

• If the dynamic range of pixel values is larger than 
Wx - ^min, scale the whole image to [tmin, Wx]; 

• If the dynamic range of pixel values is not larger 
than tmax — ^min, adjust the average intensity of the 
image towards [tmin^^max] until all pixel values are 

within [tmin,^max]. 

The above scheme is therefore adopted in our implemen- 
tation of the USO method. For the four test images with 
reported PSNR values in [1], we obtained a higher aver- 
age PSNR value (21.7 dB vs. 20.9 dB) under the above 
setting to generate data given in p^, Table III-2]. 

3 Proposed DC Recovery 
Method 

One major drawback of the USO method is the low ac- 
curacy of the DC estimation process for some images. 
Small estimation errors in Step 1 may propagate to re- 
sult in a large number of under /over- fiows of pixel val- 
ues. These under /over-fiows can also affect Step 2 due 
to the inaccurately estimated DC ranges of some blocks. 

To illustrate this problem, we examine the 8-bit gray 
scale image in Fig.jlja). The four images obtained from 
four different scans (after Step 2) are shown in Fig.[2ja)- 
(d). Their pixel value ranges are [—83.0,338.0], 
[-90.3,345.3], [-92.0,347.0], [-136.3,391.3], respec- 
tively. We see that the error propagation effect is very 
serious. By averaging the four images, the pixel value 
range is [—88.6,303.0], which is still far from the valid 
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one. After scaling the range to [0,255], we get a better 
image as shown in Fig. [ijb). However, the quahty re- 
mains poor with a PSNR value of 14.3, an SSIM score 
of 0.732 and an MS-SSIM score of 0.711. 




Figure 1: (a) A test image of size 384 x 256 and (b) the 
recovered image using the USO method. 




Figure 2: Four intermediate images recovered from the 
DC-free edition of the test image Fig. [TJa) using the 
USO method. 



3.1 Under/Over-Flow Rate Minimiza- 
tion (FRM) Method 

To limit the error propagation effect, we apply Property 
2 during the relative DC estimation process in Step 1 as 
follows. For each block, after the relative DC estimation 
is obtained, we immediately check if the estimated DC 
coefficient is outside of the valid range. If this occurs, we 
re- adjust the estimated DC toward the valid range until 
all pixel values fall in interval [tmin, ^max]- After that, 
the scanning process proceeds to the next block. Since 
the DC-bounding process is done during the scanning 
process for all blocks, the output of Step 1 will contain 
no underflow or overflow pixel values. Then, Steps 2 
and 4 become unnecessary and can be removed. 

The modified DC estimation method can effectively 
limit error propagation. However, it encounters a new 
problem. That is, it may not be able to recover the 
DC coefficient of each block accurately if the estimated 
value of DC{Bo) is far from the ground truth. This is 



because the estimates obtained in Step 1 are actually 
DC coefficients of all blocks relative to DC{Bo). Thus, 
it is important to find an accurate estimate of DC(5o) to 
ensure that DC coefficients of most blocks are estimated 
with an acceptable accuracy. 

While there is no prior knowledge on the ground truth 
of DC{Bq)^ we can exploit a statistical approach to es- 
timate its value for natural images. It is based on an 
interesting phenomenon of the run-time DC-bounding 
process in Step 1: when the estimate of DC{Bo) is closer 
to the ground truth, under /over- flows in pixel values 
tend to occur less frequently. Although some counter 
examples exist, this observation holds for most tested 
natural images 

We use an example to illustrate the above observa- 
tion. For the test image shown in Fig. [ija), the rela- 
tionship between the under /over- flow rate and the esti- 
mated DC{Bo) is shown in Fig. [3] for four different scans. 
The under /over-flow rate is defined as Mfr/M^, where 
MpR is the number of blocks with under /over-flow pixel 
values and Mb is the total number of blocks in the im- 
age. In Fig. [3] the dotted line gives the ground truth of 
DC{Bq) while the dashed line gives the DC{Bq) value 
that has the minimal under /over- flow rate. When there 
are multiple points that have the minimal under/over- 
flow rate, the dashed line shows the mid-point of the 
range defined by the leftmost and the rightmost points. 
It is clear that the minimal under /over-flow rate always 
occurs around the ground truth of DC{Bo). 




Figure 3: The relationship between the under/over- 
flow rate and the estimate of DC{Bq) for the test im- 
age Fig. [TJa) where Subfigures (a)-(d) give results for 
Scans 1-4, respectively. 

If the best estimate of DC(5o) is used for each scan, 
we obtain the results in Fig. |4] By comparing Figs. |4] 
with [2) we see that the proposed FRM method performs 



^It is difficult to theoretically analyze the accuracy and sensi- 
tivity of the estimation process of DC(Bo). We conducted experi- 
ments on 200 test images to conffim its correctness and efficiency 
(see Sec. 3.3 for more detail). 
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significantly better. By averaging the four images ob- 
tained by four scans, we show the final recovered image 
in Fig. |5j This recovered image has a PSNR value of 
23.2, an SSIM score of 0.900 and an MS-SSIM score of 
0.924. 




Figure 4: The images obtained by minimizing the 
under/ over- fiow rate for four scans of the test image 
in Fig. [ija), where Subfigures (a)-(d) give results for 
Scans 1-4, respectively. 




Figure 5: The final recovered image using the FRM 
method from the DC-free edition of the test image in 
Fig. [3a). 



3.2 Computational Complexity 

The above FRM method is essentially an optimiza- 
tion process of the observable under /over-fiow rate dur- 
ing the DC estimation process. To get the best re- 
sult, an exhaustive search of all possible values of 
DC(5o) could be tried. When the step size is A, 
N{tmax - tmin + mm{BQ) - max(5o))/^ values of 
DC{Bo) are checked for each block in each scan, and 
the computational complexity of the FRM method 
is 0(MBiV(tmax-tmin + min(5*)-max(5*))/A) < 
0(MBA^(tmax -^min)/A). In coutrast, the USO 
method has a complexity of O(M^), so it is faster than 
the FRM method. 

To reduce the computational complexity, we notice 
that the relationship between the under /over-fiow rate 
and DC{Bo) is close to a uni- modal function. Thus, a 



binary-search strategy can be used to locate the opti- 
mal DC{Bo). At the beginning, the two ending points 
and the midpoint of the whole valid DC range are 
searched. Then, the DC range is reduced to be the 
sub-interval corresponding to the two smaller under- 
fiow/overfiow rates. This process is repeated until a 
pre-defined precision is reached. The computational 
complexity of such a binary search process is less than 
O {Mb log2(A^(Wx - tmin)/A)). 

Note that the computational complexity of the FRM 
method is 0{Mb) if max(5Q) — m.m{BQ) ^ tmax — ^min, 
which may happen for some images. In this case, it is 
as efficient as the USO method. 

3.3 Experimental Results 

To verify the effectiveness of the proposed FRM method, 
we conducted experiments on an image database of 200 
test images. Our MATLAB code and the test images are 
available at http://www.hooklee.coin/default.asp? 
t= AC2DC , To measure the quality of recovered im- 
ages, we compare the performance of the proposed FRM 
method with that of the USO method using the PSNR 
values and other nine IQA metrics in the MeTriX MuX 
Visual Quality Assessment Package [9]. The results are 
shown in Fig. [6j where the x-axis is the image index, and 
the y-axis is the difference of IQA scores and the dashed 
line shows the mean of the IQA differences of all images. 
A positive y-Yolue means the proposed FRM method 
outperforms the USO method. We can see that the per- 
formance improvement is consistent and significant for 
most images. Although PSNR and WSNR values be- 
come significantly worse for a few images (e.^.. Images 7 
and 27), their visual quality decreases only slightly or 
remains nearly the same if measured with other (more 
accurate) metrics like MS-SSIM and VIF. 

4 Conclusion and Future Work 

A method to recover DC coefficients from AC coefficients 
of DCT-transformed images, called the USO method, 
was proposed in [1 . In this work, we proposed an 
improved DC recovery method and called it the FRM 
method. Experiments showed the performance improve- 
ment of the FRM method over the USO method for 200 
test images in terms of the PSNR value and nine differ- 
ent IQA metrics. 

There are several ways to further improve the pro- 
posed FRM method. For instance, if we have some 
statistics of the image to be recovered, we may be able 
to get a better estimate of DC(5o) from the mini- 
mized underffow/overffow rate. We may define the un- 
derfiow/overfiow rate in a different way, e.g., modulat- 
ing the simple definition with the degree of the under- 
flow/overfiow. In addition, more advanced DC predic- 
tion and averaging schemes may be used by exploit- 
ing the prior knowledge on the underlying image. An- 
other possible improvement is to change the locations of 
the initial reference blocks and/or the scanning pattern. 
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Pr[A(PSNR)>0] = 0.735, Mean(A(PSNR)) = 1.57 



Pr[A(SSIM)>0] = 0.765, Mean(A(SSIM)) = 0.0248 
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Pr[A(MS-SSIM)>0] = 0.88, Mean(A(MS-SSIM)) = 0.0567 
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Pr[A(WSNR)>0] = 0.73, Mean(A(WSNR)) = 1.52 
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Pr[A(NQM)>0] = 0.935, Mean(A(NQM)) = 2.07 
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Pr[A(IFC)>0] = 0.895, Mean(A(IFC)) = 0.166 
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Pr[A(VIF)>0] = 0.925, Mean(A(VIF)) = 0.0317 
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Pr[A(VIFP)>0] = 0.925, Mean(A(VIFP)) = 0.0265 
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Pr[A(UQI)>0] = 0.82, Mean(A(UQI)) = 0.0345 
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Pr[A(VSNR)>0] = 0.85, Mean(A(VSNR)) = 2.04 
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Figure 6: The performance improvement of the proposed FRM method over the USO method using different IQA 
metrics. Note that the range of some metrics (SSIM, MS-SSIM, VIF, VIFP, UIQ) is [0,1], so the mean values 
corresponding to these metrics are relatively small. 



Theoretically speaking, any block can be the initial ref- 
erence block and any scanning pattern can be used. It 
may also be beneficial to have multiple initial reference 
blocks and scanning patterns. One of the implications 
of the DC recovery method is that a fewer amount of 
information about DC coefficients should be encoded in 
image compression. We will investigate this possibility 
as well in our future work. Yet another work of interest 
is to compare the performance of DC recovery methods 
by employing human experts, which will offer more con- 
vincing results than the objective IQA metrics used in 
this paper. 
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